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Objective 


• Characterize relationship between disease outbreaks 
and environmental, meteorological parameters 

• Use the relationship to forecast disease outbreaks 

• Disease applications: 

- Seasonal and pandemic influenza, malaria, dengue 


Schematic Approach 


Data Center 

(i.e. NASA GES DISC; NOAA NCDC; 
USGS Earth Explorer, etc) 
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Multi-temporal earth science products 

Satellite-derived products Gmund observations 



Weekly influenza 
epidemiological data 



Final Output: Meteorological dependency, 
climate-based influenza forecast 
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Modeling 

Regression model 
Neural Network 
Decision Tree 
Agent-Based model 
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Model calibration 
and validation 
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Data processing 


Data continuity and 
integrity checking 

Additional variable/ 
indicator derivation 


Spatiotemporal 

aggregation 
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Weekly meteorological 
time series 







Meteorological Data Processing 


• Epidemiological and virological surveillance data are typically 
aggregated 

- Spatially: district, provincial or national level 

- Temporally: weekly or monthly 

• Satellite data processing 

- Projection; masking region of interest; spatial and temporal averaging; 
data imputation 

• Ground station processing 

- Spatial and temporal averaging; data imputation 




Create lag variables 


Meteorological Data Processing 


Internal database of satellite data for epidemiological analysis 

• Six satellite data products 

• Spatial and temporal aggregation capabilities 


Search 
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Aerosol Optical Depth @ 0.47U. 0.55u. and 0.66u 


Land Surface Temperature (Day) 


Land Surface Temperature (Night) 
Normalized Difference Vegetation Index 
Accumulated Rainfall (Daily) 

Rainfall Rate Estimate (Monthly) 


Land Surface Temperature (Day) 

Product: 

M0D11C1 

Temporal Resolution: 

Daily 

Spatial Resolution: 

0.05 degree 

Geospatial Coverage: 

Global 

Start of Data: 

2000-03-05iD00000Z 

End of Data: 

201 1-08-24030000Z 




(IT) ^^Time sp an: 


2000-03-05 


to 2011-08-24 


® Select area by coordinates Select area by region 


N: 0.0 


W: 0.0 E: 0.0 

S: 

0.0 
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O Split Points Current Depth 

Spatial Integration: 

0 Average Points Maximum Depth 


O Select area by coordinates ® Select area by region 


Afghanistan 

v | 
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ALL 





□ Desired Temporal Resolution: Od 
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©-► Date Preference: ® Group Tag Tag Each Point 


Submit Request 




cambodia Jstday.montWyl - Notepad 


File Edit Format View Help 

#Land Surface Temperature (Day) [MODllCl]#Created on: 2011-09-28 
fOrigtnal filename: cambodia_lstday_dailyl. txttRequested bv: 
jlef ler#Temporal Coverage: 2000-12-31 - 2011-04-02#Tempora 1 
Resolution: Daily#spatial coverage: Cambodia => ALL (9.91361*N => 
14.688171‘N; 102. 335502‘E -> 107. 629989-E)#Spatial Resolution: 
0.05#Null/Fill value: 0#Times tamps are contained alone on a line 
and apply to all following data#<latitude> <longitude> <LST_day 
Kelvin>#Tlie data has been split into regions and averaged#Cambodia, 
Svay Ri eng2000-12 - 31/OOOOOOZ 301. 6776622001-01-01/000000z 

303.6880782O01-03-01/0O0OOQZ 


298. 9897802001 -02 -Ol/OOOOOOZ 
301 . 5 363182001 -04-01/0000007 
299. 6063172001 -06 -Ol/OOOOOOZ 
299. 94905 22001 -08-01/000000Z 
297. 3496482001 -10-01 /OOOOOOZ 
297. 5454922001 -12-Ol/OOOOOOZ 
302 . 4835892002 -02 -Ol/OOOOOOZ 
307. 6148992002 -04-01/000000z 
303. 3238752002-06-01/0000007 
299. 5260632002 O8-OI/OOOOOOZ 
296. 9314952002-10-01/0000007 
298. 5856732002-12-01/0000007 
300. 7822632003-02-01 /OOOOOOZ 
304.8599682003-04-01 /OOOOOOZ 
300. 3783212003 -06-01 /OOOOOOZ 
299 . 4 36 3 3 5 200 3 -08 -01 /OOOOOOZ 
297. 2062802003 -10 -Ol/OOOOOOZ 
300. 07004 32003 -12 -Ol/OOOOOOZ 
301 . 94 04 08 2 004 -02 -01 /OOOOOOZ 
308. 2 5 3585 2004 -04 -01 /OOOOOOZ 
298. 7182872004 -06 -01 /OOOOOOZ 


303. 0871962001 -05-01/000000z 
301. 0795902001 -07 -Ol/OOOOOOZ 
297. 5522332001-09-01/000000z 
297. 2009402001-ll-01/000000z 
299. 9803052002 -01 -Ol/OOOOOOZ 
305 . 90541 32002 -03 -Ol/OOOOOOZ 
304. 8536782002-05-01/000000z 
299. 6030632002-07-01/000000z 
298. 9367282002 -09 -Ol/OOOOOOZ 
298. 9326822002-ll-01/000000z 
299. 3275822003-01-01/000000z 
303. 5433192003-03-01/000000z 
306. 3083502003-05 -Ol/OOOOOOZ 
299. 5293812003-07-01/000000z 
297. 6174492003-09-01/0000002 
296. 9 32647 2003 -11 -Ol/OOOOOOZ 
301. 11147 32004-01-01/000000z 
305. 0419482004-03-01/000000z 
304.0777422004-05-01/000000z 
297. 6456142004 -07 -Ol/OOOOOOZ 





Influenza: The Problem 


Latitudinal variation of seasonal influenza 
epidemics 

- Temperate region: distinct annual peak in winter 

- Tropical region: less distinct seasonality, multiple peaks 

Southward migration in Brazil 

- From low population in the tropics to dense area with 
temperate climate 

Suggest the role/influence of environmental and 
meteorological factors 

- Several meteorological parameters has been implicated 
in influenza outbreaks 

- Temperate region: low temperature and humidity 

- Tropical region: rainfall in several countries 


i 


Viboud etal. (2006). 
PLoS Medicine 3:e89 
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Example: Influenza In Central America 
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1 San Marcos 

2 Quezaltenango 

3 Guatemala 

4 Santa Rosa 

5 Santa Ana 

6 La Libertad 

7 San Salvador 

8 Cuscatlan 

9 San Miguel 

10 Chiriqui 

11 Panama 


El Salvador 


Soebiyanto RP, Clara W, Jara J, Castillo L, et al. (2014) The Role of Temperature and Humidity on Seasonal Influenza in Tropical 
Areas: Guatemala, El Salvador and Panama, 2008-2013. PLoS ONE 9(6): el00659. 



Meteorological Data 


Data Source 

- Tropical Rainfall Measuring Mission (TRMM): Daily resolution at 0.25° (~ 25 km) 

- Global Land Data Assimilation System (GLDAS): 3-hourly resolution at 0.25° (~ 25 km) 

Precipitation: TRMM 

Near Surface Temperature: GLDAS 

Near Surface Specific Humidity: GLDAS 


Meteorological data processing 


Multi-temporal (daily) precipitation 
rate (TRMM) from Giovanni 



Spatio-temporal 

aggregation 


Guatemala Guatemala El Salvador 

Central departments Western departments San Miguel 





West central departments Chiriqui Panama 
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Rainfall (mm/day) 


Regression Modeling 


Logistic regression 

Y kt ~ Bin (N kt p kl ) 

Y k , is the number of samples tested positive for influenza virus in location k at week t; 
N kt is the total samples collected/processed from location k at week t; p kt is Y kt / N kt 


The logit of influenza positive proportion is defined as: 


z k , = in 


PH A 


The full model can be written as: 


=l« 


O Regression coefficients to be estimated 

4 3 



jkt 


7=1 





n 

nkJ yv kt 


/= 1 


Meteorological variable 
(i.e. temperature, humidity, rainfall) 


m - 1 


n - 1 


V 


Previous weeks 
influenza activity 


V 


V 


Co-circulating viruses (RSV, 
adenoviruses) as confounding factor 


Polynomial function of 
week number 


Results: Estimated Coefficients 


Country and Province 

Adjusted Odds Ratio (95% Confidence Interval) 

Meteorological 
Variable Average 
Period 

Prediction 


Temperature 

Specific Humidity 

Rainfall 


RMSE 

Corr. 

Coeff 


CC) 

(g/kg) 

(mm/day) 




Guatemala 

Central departments 

1.01 (0.88, 1.15) 

0.79 (0.69, 0.91) 

1.05 (1.01, 1.09) 

Prev. 1 -3 wks ave. 

0.08 

0.12 

Western departments 

0.94 (0.80, 1.11) 

0.72 (0.60, 0.86) 

1.01 (0.98, 1.04) 

Prev. 0-1 wks ave. 

0.13 

0.08 

El Salvador 

West-central departments 

0.80 (0.70, 0.91) 

1.18 (1.07, 1.31) 

1.00 (0.99, 1.02) 

Prev. 1 wk ave. 

0.06 

0.50 

San Miguel 

1.28 (0.99, 1.65) 

1.32 (1.08, 1.63) 

0.98 (0.92, 1.05) 

Prev. 1 -2 wks ave. 

0.13 

0.02 

Panama 

Chiriqui 

1.30 (0.85, 2.02) 

1.97 (1.34, 2.93) 

0.95 (0.87, 1.04) 

Prev. 0-3 wks ave. 

0.11 

0.73 

Panama 

1.13 (0.80, 1.61) 

1.44 (1.08, 1.93) 

1.10 (1.05, 1.14) 

Prev. 1 -2 wks ave. 

0.07 

0.90 


Bold font indicates a statistically significant variable (p-value< 0.05). RMSE is the Root Mean Squared Error and Corr. Coeff is the correlation coefficient between the 
observation and estimated influenza positive proportion in 2013. 

The models were adjusted for: potentially confounding variables (RSV, parainfluenza and adeno viruses), previous weeks' influenza positivity, seasonality and other 
possible nonlinear relationships (modeled as a polynomial function, up to degree of 3, of the week number). 
doi:1 0. 1 371 /journal.pone.01 00659.t002 


Specific humidity was consistently associated with influenza activity in all study locations 
with bimodal relationship: 

Proportional relationship in Guatemala and inverse relationship in other locations 


Results: Training and Prediction 


Guatemala Guatemala El Salvador El Salvador Panama 

Central departments Western departments San Miguel West central departments Chiriqui 


Panama 

Panama 
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Modeled training data 


Prediction 


Observation 


Temperature (Celcius) Specific Humidity (g/kg) Rainfall (mm/day) I nfl. Positive Proportion 


Neural Network 


Artificial intelligence method that mimic the functioning of the brain 


L 


Artificial neuron 




f(») can be sigmoid function, 
radial basis function, etc 


Temperature 

Precipitation 

Humidity 



Influenza 

Activity 


Input Layer Hidden layer 


Output Layer 


Proportion of Weekly Positive 


Neural Network Example 


Neural Network (NN) and ARIMA outputs for New York City and Maricopa County (AZ) 
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MARICOPA COUNTY 

♦ Data -ARIMA -NN 



Input 


RMSE R 2 

(Fit/Pred) (Fit/Pred) 


ARIMA Mean Dew Pt (4) 0.046/0.022 0.311/0.795 
TMAX (1), Rain 

NN (3), TMIN (2) 0.044/0.0036 0.731/0.584 


ARIMA RHMAX (3), LST(3) 0.575/0.5493 0.911/0.941 
RHMIN(l), 

NN TEMP(4) / SOLAR(4) 0.608/1.089 0.820/0.754 


NN model shows that -60% of influenza variability in the US regions can be accounted by 
meteorological factors 



Summary: Challenges 


Meteorological Data and Processing 

• Changes in or heterogeneity of: location, formats, algorithm, 
availability (data continuity) 

• Storage capacity 

• Data products validation 

Uncovering patterns & modeling 

• Choice of mathematical and statistical models 

• Each model has assumptions such that results and prediction may 
need to be appropriately interpreted 

• Parameter constraints and prediction validation 
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